Program Performance Michelle

نویسنده

  • Michelle Mills Strout
چکیده

Many scienti c applications require sparse matrix computations. For example, Finite Element modeling and N-body simulations. It is di cult to write these codes in a portable way which also achieves high performance because of the sparsity of the matrices and because current architectures have deep memory hierarchies and multiple levels of parallelism. Therefore the implementation of such computations become obfuscated because of the hand tuning necessary to get performance on a speci c architecture. Three performance aspects which must be dealt with are the matrix sparsity, data locality, and parallelism. Typically less than 1% of the entries in the matrix are non-zero [PS98], therefore it is necessary to use sparse data structures which only store the non-zeros. There are many di erent sparse data formats which save space and computation time for matrices with certain characteristics. Deep memory hierarchies and large relative memory latencies suggest the need for data locality optimizations which take advantage of data reuse. Parallelism allows for bigger problems to be solved. Current work in this problem domain has either looked at separating the algorithm speci cation for the sparse data structure speci cation, or looked at locality and parallelism transformations for sparse computations which use speci c sparse data structures. We would like to propose having locality and parallelism transformations be able to deal with any possible combination of sparse matrix computation and sparse data structure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Iran’s National Medical Education Evolution and Innovation Plan using the Michelle and Scott’s model of policymaking

Introduction: Lack of a clear policy for the development of healthhuman resources has created inconsistencies. These imbalancesare threats to the health system to achieve its goals. Therefore, thedevelopment of human resources through proper performance ofhigher education health system is an important part of the policydevelopment process of the health sector. The present paper aimsto introduce...

متن کامل

Performance Guidance Despite Optimization Interaction

Improved application performance allows for computational analysis of bigger problems, saves money spent on computation resources, and handles more demanding multimedia applications. Having the compiler make all performance guidance solutions would be ideal. This allows performance to be dealt with separately from functionality, which is useful because it avoids having aspects of performance ob...

متن کامل

Clinical evaluation of the Nucleus® 6 cochlear implant system: Performance improvements with SmartSound iQ

OBJECTIVE This paper provides a detailed description of the Nucleus 6 system, and clinically evaluates user performance compared to the previous Nucleus 5 system in cochlear implant recipients. Additionally, it clinically evaluates a range of Nucleus 6 and Nucleus 5 programs to determine the performance benefits provided by new input processing technologies available in SmartSound iQ. DESIGN ...

متن کامل

“Management Structure and Bank Performance: An Empirical Examination of the early Republican Chinese Banking Industry.”

This paper investigates the relationship between bank performance and management structure in Shanghai between 1912 and 1937. The data come from the Shanghai's Municipal Archives and the Center for Research of Chinese Business History. Using panel data econometric technique, the analysis finds that management structure does little to explain bank performance. Rather, factors such as bank asset ...

متن کامل

Choosing a Shortest Path Algorithm

Computation of shortest paths is an integral component of many applications such as transportation planning and VLSI design. Frequently, a shortest path algorithm is selected for a given application based on the performance of the algorithm for a set of test networks. The performance of this algorithm, however, can be signiicantly diierent for networks not included in the test set. Therefore, i...

متن کامل

Technical Report The CGPOP Miniapp , Version 1 . 0

Miniapps provide performance proxies for larger applications thus enabling easier evaluation of performance tuning and refactoring techniques. The CGPOP miniapp is the conjugate gradient solver from Los Alamos National Laboratory’s Parallel Ocean Program (POP) version 2.0. This paper describes the conjugate gradient algorithm encapsulated in CGPOP, presents the organization of the CGPOP 1.0 rel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000